Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis
نویسندگان
چکیده
MOTIVATION A major issue in computational biology is the reconstruction of pathways from several genomic datasets, such as expression data, protein interaction data and phylogenetic profiles. As a first step toward this goal, it is important to investigate the amount of correlation which exists between these data. RESULTS These methods are successfully tested on their ability to recognize operons in the Escherichia coli genome, from the comparison of three datasets corresponding to functional relationships between genes in metabolic pathways, geometrical relationships along the chromosome, and co-expression relationships as observed by gene expression data.
منابع مشابه
Kernel Canonical Correlation Analysis1
This paper introduces a new non-linear feature extraction technique based on Canonical Correlation Analysis (CCA) with applications in regression and object recognition. The non-linear transformation of the input data is performed using kernel-methods. Although, in this respect, our approach is similar to other generalized linear methods like kernel-PCA, our method is especially well suited for...
متن کاملKernel Generalized Canonical Correlation Analysis
A classical problem in statistics is to study relationships between several blocks of variables. The goal is to find variables of one block directly related to variables of other blocks. The Regularized Generalized Canonical Correlation Analysis (RGCCA) is a very attractive framework to study such a kind of relationships between blocks. However, RGCCA captures linear relations between blocks an...
متن کاملA Framework for 3D Object Recognition Using the Kernel Constrained Mutual Subspace Method
This paper introduces the kernel constrained mutual subspace method (KCMSM) and provides a new framework for 3D object recognition by applying it to multiple view images. KCMSM is a kernel method for classifying a set of patterns. An input pattern x is mapped into the high-dimensional feature space F via a nonlinear function φ, and the mapped pattern φ(x) is projected onto the kernel generalize...
متن کاملGraph-Driven Feature Extraction From Microarray Data Using Diffusion Kernels and Kernel CCA
We present an algorithm to extract features from high-dimensional gene expression profiles, based on the knowledge of a graph which links together genes known to participate to successive reactions in metabolic pathways. Motivated by the intuition that biologically relevant features are likely to exhibit smoothness with respect to the graph topology, the algorithm involves encoding the graph an...
متن کاملRegularized Generalized Canonical Correlation Analysis Extended to Symbolic Data
Regularized Generalized Canonical Correlation Analysis (RGCCA) is a component-based approach which aims at studying the relationship between several blocks of numerical variables. In this paper we propose a method called Symbolic Generalized Canonical Correlation Analysis (Symbolic GCCA) that extends RGCCA to symbolic data. It is a versatile tool for multi-block data analysis that can deal with...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 19 Suppl 1 شماره
صفحات -
تاریخ انتشار 2003